Ao le memo no., 33 


TTnl varsalltv of (p*2) Ta g systems 

Uni vers ai Turing Machine 



M, Lo Minsky 


$0® Introduction® 

This report describes (1) an improvement and great simplifica¬ 
tion of the proof that the "Tag" systems of Post can represent any 
computable process, and (2) a Universal Turing machine with just 
four symbols and seven states—the saAllest yet reported. 

Part §1 of the report gives a very simple proof of the basio 
theorem II of ref® [1], that any Turing machine has a representa¬ 
tion as a Tag system. The result is that we obtain systems with 
deletion number p * 2 (see $1 below), an improvement over the re¬ 
sult p * 6 In ref. [1]. The simplification over the complicated 
methods of [1] Is due in part to a better representation of the 
Turing maohine, suggested by Dana Soott, and in part to a system 
of Post productions which yields the p * 2 result directly. The 
result of §1 Is due jointly to John Cocke and the writer. 

The simpler structure of the production rules enabled us to 
simplify both the structure of, and the encoding for, the 6-state, 
6-symbol Universal Turing machine reported in [2]. This is des¬ 
cribed in part §2 of the report. 

Part §3 is just a note simplifying (?) the basis for recursive 

functions developed In [1]. 
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dl. Turing Machines and p * 2 Tag systems. 

An instantaneous description of a Turing machine computation 
includes the contents of the Tape, the location of the maohine on 
the tape, and the machine*s internal state. The situation can be 
represented as 
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Following a suggestion of Dana Scott, we can regard the des¬ 
cription as composed ^f four numbers} a, q 1# m * £ and 

n « Y b.2* * We assume that the Maohine is binary, so that the 
a', s and b^*s are all ones and zeros. All but a finite number of 
the t,*s find b^’a are zero, so the summations are defined. 

A Turing machine T is a set of quintuples q^.s, : j 

Suppose that d^ j happens to mean "move right". Then this quintuple 

has the following effect on {a, q^,m,nj- 
1. change m to 2m + a 


1J 


and remember b^ which 


n 


3 


if n * 2p + b Q , change n to p 
is the next symbol to be read. P is [j£] 
control is transferred to the quintuple starting with 


Kj* b o) 


That IS 


s ,»m,n 


{ q lj » n ~ 2 


n 


,2m + s 


ij 


n 

2 


shore [x] is the largest integer in x • If d^j 
then the roles of m and n are interchanged. 


is "move left" 
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The Turing Machine is thus equivalent to a programmed computer 
which executes Instructions 1^ . which perform the above operation 
on m and n « Each Instruction is followed by transfer of control 
conditional on the value (0 or 1) of the symbol read in phsse 2 
above. This computer has 2 (infinite) data registers, for m and 
n , and an instruction location register to remember the pair q^s. 
It is quite easy to represent this computer in a "bi-deletion" 
(p = 2) Post Tag system. See [1]. The system operates on a string 
of symbols which contains unary representations of the numbers m 
and n . Ihe instruction location is represented by the choice of 
letters composing that string. For eaeh instruction we will employ 
a different alphabet, distinguished by subscripts. 

At the start of the k-th instruction I. corresponding to 
{qi»sthe working string will have the form 


B lc l( Tk I> n 

where the $*s represent symbols whioh can never affect the process. 
An operation of phase (1) is represented by Tag productions of the 
form (P * 2) 
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so that 

B^(y k ) n . 

Note that we have already begun the division of n by 2. This is 
the beginning of phase (1). 







The outoome of phase (2) has to dsp®nd on whether n was even 
or odd, i.e*, on whether the new symbol b Q la sero or one* Suppose 
that for n even we ore to transfer to the k^-th instruction and 
for n odd to the kj-th instruction* Then 



is a set of productions which does what we went* To check this we 
traoe through two casesj for simplicity let the s^. associated 
with this quintuple be aero* If n is odd then n * 2p ♦ Is 


» 

- a'Kx'D 2- b'(t’) 2p+1 

- Vu ( Vk )2 “ Vk‘Vk )P 

Since the string is composed of even-length blocks, the double- 
primed letters oan never affect the process and the string is equi 


valent to 


*kjl >2m B k 1 5lT fc 1 5)P 












Gw* e*~. 
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and control is transferred to the productions governing the 

k.-th alphabet o 

For even n , we have 

Aj( lj) m E$(^) 2p 

a'IC*’!) 1 2 " B’ty’) 21 ’ 

- A ki »■(* X") 2 ” \ 1 B"(7 ki y"> P * V^O 51 ” 

and the correct new values of m and n are delivered to the pro- 

( 1 ) 

ductlons governing the kg-th a ^Pk a bet» 

The representation of a Turing maohine as a 2—register computer 
Is a variant of that In Dana Scott*s sketch^ of an elegant proof 
of Theorem I of [1]. In that paper we used a kind of second-order 
arlthmetlzatlon so that the Turing machine condition could be repre¬ 
sented in the prime factorization of a single number, John Cooke 
suggested that one ought to be able to manipulate the two quantities 
of theorem la of [1] directly* and by using the "phase-shifting" de¬ 
vice burled in the lower left of the present production diagram he 
and I were able to reduce the deletion number to 2, as shown above a 
Putting these Ideas together eliminated the two major complications 
of [1], namely the factorization In Sec, 1,2 and the division algorithm 
in Sec, 2,1 of [1], The demonstration In [1], that there are univer¬ 
sal two—tape non-writing machines* can be reconstructed easily from 

the above. 


(1) Observe that each (qjS ') -*■ 12 productions, and by seme merging* 

each q, 13 prods, J 

(2) Personal communication. 
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* k X 7 Universal T uring Maohlne. , 

Coding for the 4- * < u “ x< ---——- 

. &, *.w ot ve can make a Universal machine if 

I* follows from »1 that we can m 

■«« of an arbitrary Tag ays ,em with 

. can simulate the application of an * _ dlffer . 

aletion 2 to on arbitrary -KM— *“ 

„ 1. .ill bo reprooentod by different blocks mad. «P of the 
nt aymbo ^ of this machine 

our aymbola available to our maohlne. 

,111 be divided into three regions* 


rules region 


erased region 


sequence region 


, o i v and Bo 0 will represent the 
e will use four symbols 0, 1, X# 

, tM . ^ production* will be represented by 

ianlc squares o • , » and the working 

. „, 0 ,, end i>a in the "rules region ana 

“ 8rr8nge ” t . d bT an arrangement of X-. and B-. in 

lequence .ill be represented by an arrang 

the "sequenee region » •- be 

L .t the letter, of the alphabet of the Tag -»•*- <* 

, A and let the production rule, have the form 

X ’'"" ! A *1 . (From the ccnatruotion in ll « «“* 

1 J2° j) . % vlll reprGsent; 

X M , < If but we do not use thi. fact.) W. will P 

“ “ ' e e strings of O's and l's as follows,, 

wee in the "rules, region" as strings o 


Define 


A 3 ‘ 5 * 


3-1 J* 1 

£r(l) ■= 1 +1 + X1 


1*1 


How 


for eaoh letter Aj 


«e will have a rule string Rj 


of the form 


R 


J 


uo AjrtJ) i-«i° J2l ° 



where O n means n zeroso We concatenate these In descending order 
to form the representation of the rules: 


R 


R„R_ • oR- 
n n—J. 


© 


When a symbol is encountered in the sequenoe region, it will be ne- 
cessary for the Turing machine to locate, in the rules region, the 
corresponding rules The use of the representation a 7 for a symbol 
makes this location easy, for the value of 7C7 gives (almost) the 
number of l's that the machine must traverse to reaoh the appropri¬ 
ate rule (moving from the right)« The double 1 1 at the left marks 
the end of the rule,. The counting prooeas used in locating the 
rule does not count the left one of this pair so that it sees just 
r(l) + 1 ones in the oountlngo The symbol has one extra 0 because 

7 

the copying prooess which places the new word at the end of the se¬ 
quence region skips the first 0 of each symbol copied• 

Finally we must represent the working string in the sequenoe 
region,, This is marked as a string of X's and B f s« If the sequenoe 

is k. A, o.oA, then it is represented in the sequence region, 

K 1 k 2 k s 


from left to right now, as 


S * X 




The symbol will have £7 *= 1 and - 1 will be 0, 

This symbol must not be used in regular operation (because it will 
cause 2 consecutive B*s in region s) but, as seen below, can be 
used as a halting symbol with the production a-,-* a i a i a i ® 

Finally the "erased region", where initial parts of the working 
sequence have been deleted (we do not shift the sequenoe to the left 
between applications of the rules), is Just an interval of 0*s (blanks) 



So the entire tape haa the form 



end the maohine'a operation begins with the reading head centered 
on the first mark (an X) at the left of the sequence (S). 

Example of encoding : 

Suppose the productions are 

a l ^ *1 *1 *1 

*2 a 2 a 2 

*3 *3 *3 a 2 a 3 

Then r(l) • 3 » r(2) * 2 , r(3) * 4- # 

e^ * 3 0 8^ ® ^ ^ 83 ® 8 ^ 

and 


R r - 11010101 

Rg * 110*10 5 1 

R- « 110 8 10 5 10 8 10 8 1 . 








—9 


Operation of the Universal Turing Machine. 

When started, the machine moves backend forth counting X*s to 

the right and l«s to the left. When It encounters the first B to 

the right it will have erased the first symbol A k of S, and it 

*1 

will have located to the left the rule R,, associated with that 

- *1 

symbol. It then goes into another baok-and-forth mode which copies 

the rule word at the extreme right of S. It works from the inside 

out*which is why the sequences are written left to right in S but 

right to left in R. Each 0 in an R yields an X in S, and each 1 in 

R yields a B in S. (When the latter ooours, an extra 0 is erased 
in R, accounting for the difference of one zero in the representa¬ 
tions on the two sides.) 

When the *11* is encountered in R, copying stops and the machine 
restores the tape to its initial configuration* removing the place 
markers set up in the locate and copy phases. This restitution pro¬ 
cess* almost by magio* erases the next worcL in the sequence region 3* 
thus executing the process with deletion number 2. In the operation 
of the maohine, one will note some manipulations exchanging l*s and 
B*8; these ax’s tricks by which we mark locations without using new 
symbols. 

Below is the state transition table for the machine. The tri¬ 
ples associated with each symbol-state pair ares [ symbol written „ 
direction of motion , and new state (if different)]. The machine 
starts in state 1 on the first X in region (S). 







The following sequence demonstrates most features of the 
machine*s operation* 




START 



q„ X B XX B 


- S -*■ 


This sequence contains one genuine rule string - 110010001 - which 
when copied produces the sequence BXXBX .* The two l f s in the 
center of the example are skipped over in reading, and are there 
to illustrate how the rule is looated—precisely by skipping over 
two l»s* When the string is copied, the original is erased, so 
this example keeps cycling and never halts. 



f 
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The Halting Condition 

The machine will halt if it la in state q^ and encounter* 
a 0 • This will occur if there are two consecutive B*s in S 
encountered in the locating phase* We can define a letter Ag 
In the alphabet of the fomal system to be a Halting symbol which 
stops the generation of sequences (e.g., by providing no rule for 
operating on it). In the Turing machine representation this is 
easily represented by assigning to the symbol Ag the R-sequenoe 

110 10 1 


which will cause two B*a to be written consecutively at the end 
of S . When the machine encounters this it will stop. It is a 
little hard to Insure that this will be hit in the correct state, 
so that it may be simpler to guarantee this by writing 3 B*s p 
e.g. f by using for Ag the sequence 11010101. For this 
we should take r(H) ® 3 • It is convenient to let Ag be a^ . 


For a general dlaoussion of small Universal Machines, see [2]. 
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The Reduction of Machine Computations to Programs Involving 
Two Arithmetic Instructions Operators on Two Registerso 


Theorem I„ of (1) states that 

we can represent any partial recursive function T(n) by 
a program which operates on two Integer variables S* and 
Sp? this program la composed entirely of instructions of 
the four types 


(Al) 

(A2) 

(Bl) 

(B2) 


Add 1 to S-, 
Add 1 to Sp 

If s x > 0 , 

go to Ijj_| 

If s 2 > 0 , 

go to Ij X J 


o Co to Ij• 

o Co to I 4 « 

J 

subtract 1 fro. S x and 
otherwise go to Ij 2 <> 
subtraot 1 from Sp and 
otherwise go to I. 2 <> 


By a partial recursive function we mean, of course, a mapping 
from some integers to other Integers defined by a Turing machine 
together with an Interpretation of some of its tape configurations, 
as representations of integers,, If T(n) is such a partial function 
the cited theorem states that there is such a program which, if 
started at an initial instruction I., with set at 2“ and 

Sp set at 0 , then (if T(n) is defined) the program will halt 
at a terminal Instruction I with S^ * 2 T ^ n ^ and S 2 m 0 * 

(The question is open, and presumably negative, as to whether one 
can do better with programs of such instructions so as to eliminate 
the exponent in the representation„) 

The theorem cited above concerns programs made of four Instruction 






V 


types* We can reduce this to three . in effect by replacing 
types A2 and B2 by an exchange operation which switches 
and Sp o We oan reduce the basis to two by merging the exchange 
operation with Al and Bl as follows* 

We define new add and conditional subtract instructions: 

(A) Add 1 to S^, exchange and Sg, and go to I,* 

(B) If Sm > 0, subtraot 1 from S^, exchange S^ and 

otherwise exchange S^ and Sp 

and go to Ij 2 « 

Now consider the effect of a linear sequenoe (subroutine) of the 
fora 

A**A“*A"*B^Ij • 

This will add 1 to S^, then add 1 to 82 * then add another 1 to 
and then subtraot 1 from S 2 » The latter subtraction must occur 
since the second add assures us that Sg will not be zero at the 
time of the fourth operation* Thus the net result amounts to an 
operation of the form of the original Al, except that we have 
added 2 to S« instead of just 1 * 

Similarly* it can be seen that the subroutine AABA 


Sg, and go to j 


A-*A-*B A-*I 


J 


has the effect of A2, again with an increment of 2 Instead of 1* 


Now consider the sequenoe 


J2 


This leaves S, Invariant in any case. We make both branches from 
the seoond Instruction go to the third* Then if Sg is zero the 
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program exits from the second branch without changing anything* 

If Sp > 2 then Sg is (tocremented by 2 and the prgram leaves by the 


first branch* So 



S g hai 


this routine is 


like B2 except for decrementing by 2* Similarly 


B 5 l n 

B 3 A «* B 


3 I 


J2 


is like Bl 


If we note that the numbers are changed by two in every case. 


we see that 



8 ttdS 



system (A A A B, A A B A, A B B B, B 
old system (Al, A2, Bl* B2)» 


then the 


ABB) is iBomorphio to the 




I wish to acknowledge the value of discussions with A* R* Tritter 
in connection with the work reported here* 








IV---A 2-symbol 25-state Universal machiae. 


The diagram below gives the structure of a 2x25 machiae obtsiaed from 
the 4x7 machiae by eacodiag the symbols of the latter ia two-digit blocks: 

0-00 ; 1-01 ; X-10 ; B — 11. 

The traaslatioa from the 4x7 machiae is straightforward, with the same 
tape eacodiag. Bach of the sevea states is replaced by a three-state decodiag 
tree. la some of these trees it was possible to eacode the full quintuples, 
but ia ethers additional states were aeeded. There is no reasoa to suppose 








(1) M« Minsky, "Rsouralvs Unaolvsblliky of Post** Problsa of Tag 
Ann. ttstlu. Vdl. % 9 no. ; - 

(2) M. Mina Icy. rt Si»s and Struotuirs oflkiiwr#»lTwing Maohinss 
Using Tsg Syatwas,* AMS Synposlft on #MM<^psilM»»t.iss, Voi* $ 9 
X96I (In {tPHi)« 
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